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TARGETED HETERO-ASSOCIATION OF RECOMBINANT 
PROTEINS TO MULTI-FUNCTIONAL COMPLEXES 

Background of the Invention 

Increasingly, there is a need for proteins which combine two or more functions, 
such as binding or catalysis, in a single structure. Typically, proteins which 
combine two or more functions are prepared either as fusion proteins or through 
chemical conjugation of the component functional domains. Both of these 
approaches suffer from disadvantages. Genetic "single chain" fusions suffer the 
disadvantages that (i) only a few (2-3) proteins can be fused (Rock et al., 1992, 
Prof. Eng. 5, 583-591), (ii) mutual interference between the component domains 
may hinder folding, and (iii) the size of the fusion protein may make it difficult to 
prepare. The alternative, chemical cross-linking in vitro following purification of 
independently expressed proteins, is difficult to control and invariably leads to 
undefined products and to a severe loss in yield of functional material. 
Recently, methods for achieving non-covalent association of two or more of the 
same functional domains have been developed. This can be achieved through 
the use of domains attached to peptides which self-associate to form hbmo- 
multimers (Pack & Pluckthun, 1992, Biochemistry 31, 1579-1584). For example, 
the association of two separately expressed scFv antibody fragments by C- 
terminally fused amphipathic helices in vivo provides homo-dimers of antibody 
fragments in £ coJi (PCT/EP93/00082; Pack et al., 1993, BiofTechnology 11, 
1271-1277) or homo-tetramers;(Packet al., 1995, J. Mol. Biol., 246, 28-34). 
To assemble distinct protein functions such as two antibody fragments with 
different specificities fused to such association domains, the helices must have a 
tendency to form hetero-multimers. In principle, this could be achieved with 
complementary helices such as the hetero-dimerizing JUN and FOS zippers of 
the AP-1 transcription factor (O'Shea et al., 1992, Ceil 68, 699-708). The clear 
disadvantage of association domains based on hetero-associating helices, 
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however, is their pseudo-symmetry and their similar periodicity of hydrophobic 
and hydrophilic residues. This structural similarity results in a strong tendency to 
form homo-dimers and, thus, to lower significantly the yield of hetero-dimers 
(O'Shea et al., 1992, Cell 68, 699-708; Pack, 1994, Ph. D. thesis, Ludwig- 
Maximilians-Universitat Munchen). Furthermore, the formation of JUN/FOS 
hetero-dimers is kinetically disfavoured and requires a temperature-dependent 
unfolding of the kinetically favoured homo-dimers, especially JUN/JUN homo- 
dimers (PCT/EP93/00082; O'Shea et al., 1992, Cell 68. 699-708; Pack, 1994, Ph. 
D. thesis, Ludwig-Maximilians-Universitat Munchen). Because of the need for 
additional purification steps to separate the unwanted homo-dimers from hetero- 
dimers and the resulting decrease in yield, hetero-association domains based on 
amphipathic helices do not result in practical advantages compared to 
conventional chemical coupling. 

These disadvantages of the prior art are overcome by the present invention 
which provides multi-functional polypeptides and methods for the preparation of 
these multi-functional proteins. This is achieved via the use of association 
domains which are designed to associate predominantly in a complementary 
fashion, and not to self-associate. 

Detailed Description of the Invention 

In the earliest steps of protein folding, peptide chains form a disordered 
hydrophobic core by collapsing hydrophobic residues into the interior of an 
intermediate "molten globule". This hydrophobic effect is considered to be the 
most important driving force of folding (Matthews, 1993, Annu. Rev. Biochem. 62, 
653 - 683; Fersht, 1993, FEBS Letters 325, 5- 16). The burial of hydrophobic 
residues and the resulting exclusion of solvent is the determining factor in the 
stability of compact tertiary structures such as acyl-phosphatase (Pastore et al, J. 
Mol. Biol. 224, 427-440, 1992) interleukin-2 (Brandhuber et al., 1987, Science 
238, 1707 - 1709), calbindin (Parmentier, 1990, Adv. Exp. Med. Biol. 269, 27-34) 
or ubiquitin (Briggs & Roder, 1992, Proc. Natl. Acad. So. USA 89, 2017 - 2021). 



WO 96/13583 



PCT/EP95/04117 



3 

This concept forms the basis of the present invention, which provides individually 
encoded peptides or "segments* which, in a single continuous chain, would 
comprise a compact tertiary structure with a highly hydrophobic core. The 
component peptides are chosen so as to be asymmetric in their assumed 
structure, so as not to self-associate to form ' homo-multimers, but rather to 
associate in a complementary fashion, adopting a stable complex which 
resembles the parent tertiary structure/On the genetic level, these segments are 
encoded by interchangeable cassettes with suitable restriction sites. These 
standardized cassettes are fused C- or N-terminally to different recombinant 
proteins via a linker or hinge in a suitable expression vector system. 

Thus, the present invention relates to a multi-functional polypeptide comprising: 

(a) a first amino acid sequence attached to at least one functional domain; 

(b) a second amino acid sequence attached to at least one further functional 
domain; and 

(c) optionally, further amino acid sequences each attached to at least one 
further functional domain; 

wherein any one or more of said amino acid sequences interacts with at least one 
of said amino acid sequences in a complementary fashion to form a parental, 
native-like tertiary or optionally quaternary structure and wherein the parental, 
native-like tertiary or optionally quaternary structure is derived from a single 
parent polypeptide. In this context, the term parent polypeptide refers to a 
polypeptide which has a compact tertiary or quarternary structure with a 
hydrophobic core. The invention provides for many different parent polypeptides 
to be used as the basis for the association domain. Suitable polypeptides can be 
identified by searching for compact, single-domain proteins or protein fragments 
in the database of known protein structures (Protein Data Bank, PDB) and 
selecting structures that are stable and can be expressed at high yields in 
recombinant form. These structures can then be analyzed for hydrophobic sub- 
clusters by the method of Karpeisky and llyn (1992, J. Mot. Biol. 224 % 629-638) or 
for structural units (such as ^-elements or helical hairpin structures) by standard 
molecular modelling techniques. In a further embodiment, the present invention 
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provides for multi-functional polypeptides wherein the single parent polypeptide is 
taken from the list ubiquitin, acyl phosphatase, IL-2, calbindin and myoglobin. 

In a preferred embodiment, the present invention provides a multi-functional 
polypeptide comprising two or more amino acid sequences each attached to at 
least one functional domain, wherein any two or more of said amino acid 
sequences can associate in a complementary fashion to provide a parental, 
native like, tertiary or optionally quaternary structure. 

Once structural sub-domains are identified, the protein is dissected in such a way 
these sub-domains remain intact. The selection process can be expanded to 
proteins for which no structure is available but which satisfy the criteria of stability 
and good expression. For these proteins, folding sub-domains can be determined 
by hydrogen exchange pulse-labelling of backbone amides during the folding 
reaction, followed by NMR detection in the native state (Roder et al., 1988, 
Nature 355, 700-704; Udgaonkar & Baldwin, 1988, Science 255, 594-597). 
Alternatively, folding sub-domains can be identified by mild proteolysis, 
denaturation, purification of fragments and reconstitution in vitro (Tasayco & 
Carey, 1992, Science 255, 594-597; Wu et al., 1993, Biochemistry 32, 10271- 
10276). Finally, additional clues for the choice of cleavage sites can be obtained 
from the exon structure in the case of eukaryotic proteins, since the exons 
frequently (though not always) correspond to structural sub-domains of a protein. 
This has, for example, been discussed for the case of myoglobin (Go 1981, 
Nature 291, 90). 

The yield of properly assembled molecules is expected to decrease significantly 
for constructs in which a protein domain is divided into three or more parts. This 
is due to the fact that several sub-domains must come together simultaneously to 
form a viable structure. This effect is countered by dividing the polypeptide chain 
into sub-domains that represent folding units (identified by the methods described 
above). Thus, not only the final, assembled complex but also assembly 
intermediates will have the stability necessary to allow their accumulation in the 



WO 96/13583 



PCIYEM5/04117 



5 

host during expression, resulting in a greatly improved kinetic behaviour of the 
system. 

In solution, the isolated segments have little secondary structure and remain 
monomeric or form transient, non-specific and easily disrupted aggregates. Only 
upon mixing, either by separate expression and purification, or by co-expression, 
can the concerted folding of complementary segments provide the necessary 
intermediate interaction of residues (Matthews, 1993, Anna. Rev. Biochem. 62, 
653 - 683) that results in the formation of a compact, native-like structure/This 
association, mainly driven by the burial of hydrophobic residues of all segments 
into a single hydrophobic core, leads to a targeted assembly of the N- or C- 
terminally fused proteins to a multi-functional complex in vivo or in vitro. 
Optionally, the reconstituted native-like structure may also contribute an 
enzymatic or binding activity to increase the number of effector functions in the 
assembled complex. Accordingly, the present invention also provides a multi- 
functional polypeptide as described above, in which the native-like, tertiary or 
quaternary structure provides a biological activity. For example, when acyl 
phosphatase is used as the basis of the association domain, it is expected that 
the multi-functional polypeptide will retain some phosphatase activity. 
The present invention provides for many different types of functional domains to 
be linked into the multi-functional polypeptide. Particularly preferred are cases in 
which one or more, preferably two, of said functional domains are fragments 
derived from molecules of the immunoglobulin superfamily. In particularly 
preferred embodiments, said fragments are antibody fragments. Also preferred 
are cases in which at least , one of the functional domains possesses biological 
activity other than that associated with a fragment derived from a member of the 
immunoglobulin superfamily. By way of example, the present invention provides 
for the targeted assembly of enzymes, toxins, cytokines, peptide hormones, 
immunoglobulins, metal binding domains, soluble receptors, lectins, lipoproteins, 
purification tails and bioactive peptides to multi-functional complexes (Fig. 1) 
based on a modular system of expression vectors, restriction sites and "plug-in" 
gene cassettes coding for assembly segments, peptide linkers and functional 
domains (Fig.2). 
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If covalent linkage between the segments is necessary to prevent dissociation at 
low concentrations, cysteines can be introduced to form inter-segmental 
disulphide bridges between the amino acid sequences which comprise the 
association domain (Ecker et al., 1989, J. Biol. Chem. 264, 1887-1893; Pack & 
Pluckthun, 1992, Biochemistry 31, 1579-1584). Accordingly, the present invention 
provides multi-functional polypeptides wherein the folding of the component 
amino acid sequences is stabilized by a covalent bond. 

In order to provide some flexibility between the association domain and the 
appended functional domains, it may be desired to incorporate a linker peptide. 
Accordingly, the present invention provides for multi-functional polypeptides of 
the type described above wherein at least one of the functional domains is 
coupled to said amino acid sequence via a flexible peptide linker. By way of 
example, the flexible linker may be derived from the hinge region of an antibody. 
The invention enables even more complex multi-functional polypeptides to be 
constructed via the attachment of at least one further (poly)peptide to one or 
more of said amino acid sequences. By way of example, the further (poly)peptide 
can be taken from the list enzymes, toxins, cytokines, peptide hormones, 
immunoglobulins, metal binding domains, soluble receptors, lectins, lipoproteins, 
purification tails, in particular peptides which are able to bind to an independent 
binding entity, bioactive peptides, preferably of 5 to 15 amino acid residues, 
metal binding proteins, DNA binding domains, transcription factors and growth 
factors. 

For therapeutic purposes, it is often desirable that proteinaceous substances 
display the minimum possible immunogenicity. Accordingly, the present invention 
provides for multi-functional polypeptides as described above in which at least 
one of said amino acid sequences, functional domains, or further (poly)peptides 
is of human origin. 

In addition to the peptides and proteins provided above, the present invention 
also provides for DNA sequences, vectors, preferably bicistronic vectors, vector 
cassettes, characterised in that they comprise a DNA sequence encoding an 
amino acid sequence and optionally at least one further (poly)peptide comprised 
in the multifunctional polypeptide of the invention, and additionally at least one, 
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preferably singular cloning sites for inserting the DNA encoding at least one 
further functional domain or that they comprise DNA sequences encoding the 
amino acid sequences, and optionally the further (poly)peptide(s) comprised in 
the multifunctional polypeptide of the invention and suitable restriction sites for 
the cloning of DNA sequences encoding the functional domains, such that upon 
expression of the DNA sequences after the insertion of the DNA sequences 
encoding the functional domains into said restriction sites, in a suitable host the 

i . t 

multifunctional polypeptide of the invention is formed. In a preferred embodiment 
said vector cassette is characterised in that it comprises the inserted DNA 
sequence(s) encoding said functional domain(s) and host cells transformed with 
at least one vector or vector cassette of the invention which can be used for the 
preparation of said multi-functional polypeptides. 

In a further preferred embodiment, said host cell is a mammalian, preferably 
human, yeast, insect, plant or bacterial, preferably E. coli cell. 

The invention further provides for a method for the production of a multifunctional 
polypeptide of the invention, which comprises culturing the host cell of the 
invention in a suitable medium, and recovering said multifunctional polypeptide 
produced by said host cell. 

In a further embodiment, the invention relates to a method for the production of a 
multifunctional polypeptide of the invention which comprises culturing at least two 
host cells of the invention in a suitable medium, said host cells each producing 
only one of said first and said second amino acid sequences attached to at least 
one further functional domain, recovering the amino acid sequences, mixing 
thereof under mildly denaturing conditions and allowing in vitro folding of the 
multifunctional polypeptide of the invention from said amino acid sequences. 

In a particular preferred embodiment, said method is characterised in that the 
further amino acid sequences attached to at least one further functional domain 
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are/is produced by at least one further host cell not producing said first or second 
amino acid sequence. 

In ianother particularly preferred embodiment of the invention, said method is 
characterised in that at least one further amino acid sequence attached to at 
least one further functional domain is produced by the host cell of the invention 
producing said first or second amino acid sequence. 

In further preferred embodiments, the present invention provides for 
pharmaceutical and diagnostic compositions comprising the multi-functional 
polypeptides described above, said pharmaceutical compositions optionally 
comprising a pharmaceutical^ acceptable carrier. Finally, the invention provides 
for a kit comprising one or more vector cassettes useful in the preparation of said 
multi-functional polypeptides. 

The invention is now illustrated by reference to the following examples, which are 
provided for the purposes of illustration only and are not intended to limit the 
scope of the invention. 
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Example 1: Segmented human ubiquitin as an assembly device 

Ubiquitin is a compact intracellular protein of only 76 residues (Fig. 3) and a 
molecular weight of 5 kDa. It shows the highest conservation among all known 
proteins and is involved in the degradation pathway of intracellular eukaryotic 
proteins by forming intermediate isopeptide bonds to its C-terminus and to Lys48 
(Hershko & Ciechanover, 1992, Ann. Rev. Biochem: 61, 761-807). 
To use ubiquitin as an assembly device, the unwanted function can be abolished 
by truncation of the last three C-terminal residues (-Arg-Gly-Gly), and the 
exchange of Lys48 to Arg, which prevents the formation of isopeptide bonds to 
this residue. The altered sequence is then divided in a loop at position Gly36, so 
that the hydrophobic core falls apart into two segments (called ALPHA and 
BETA). The synthetic nucleotide sequence of the segments (Fig 4, 5) carry 
appropriate restriction sites (Mrol-Hindlll) at the termini, so that the cassette 
encoding the segments can be easily ligated to a EcoRI-Mrol cassette encoding 
the flexible linker (hinge of hulgG3; Fig. 6). The cassettes are inserted into the 
expression vector plG3 (EcoRI-Hindlll; Fig. 7) encoding the scFv fragment of the 
antibody McPC603 under the lac promoter/operator (Ge et al., 1995, in: Antibody 
engineering: A practical approach. IRL Press, New York, Borrebaeck ed., 229- 
261). Insertion of a second functional fragment (scFv fragment of the anti-B- 
lactam antibody 2H10 with phoA signal sequence) linked to association segment 
BETA as an Xbal-Hindlll DNA fragment (Fig. 8) results in a di-cistronic 
expression vector (Pack, 1994, Ph. D. thesis, Ludwig-Maximilians-Universitat 
Munchen). After induction with IPTG and translation, the signal sequences guide 
the antibody fragments fused to the assembly segments to the periplasm, where 
they assemble to a complex with a reconstituted native-like ubiquitin fold and two 
different antibody specificities. The complex, a bispec'rfic immunoglobulin, can be 
recovered and purified by affinity chromatography of cell extract (Pack, 1994, Ph. 
D. thesis, Ludwig-Maximilians-Universitat Munchen). 
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Example 2: Covalent linkage of the native-like tertiary structure of 
the assembly device by engineered disulphide bridges and 
combination of a C-terminal peptide linker with an in-frame 
restriction site. 

The conformational stability of undivided, native ubiquitin can be enhanced by 
introduction of disulfides at positions 4 and 66 without perturbation in the 
backbone (Ecker et aL, 1989, J. Biol. Chem. 264 t 1887-1893; Fig. 9). In the 
context of this invention, the engineering of disulfide bridges provides the 
covalent linkage of segments (Fig. 10, 11) after co-folding and assembly. 
To raise the number of possible functional domains in the assembled complex, a 
C-terminal peptide can be fused to one or more of the segments of the assembly 
device. To fuse a functional domain like an enzyme, cytokine, antibody fragment, 
purification peptide or toxin to this linker, a restriction site, preferably unique, has 
to be introduced in-frame (Fig. 11). Gene synthesis, cloning, expression as well 
as recovery of the assembled, covalently linked complex is according to example 
1. 

Example 3: Segmented human interleukin-2 (IL2) as an assembly 
device 

Human interleukin-2 (Brandhuber et al„ 1987, Science 238, 1707 - 1709; Kuziel 
& Greene, 1991, in: The Cytokine Handbook. Academic Press, 84-100) is used 
as an assembly device by segmentation between position His79 and Lys 80 (Fig. 
12). The device, encoded by Mrol-Ascl-Hindll gene cassettes (Fig. 13, 14) 
combines the low immunogenicity of the plasmatic protein with a preferable 
effector function of the native-like cytokine structure and ah intersegmental 
cysteine bridge (Cys58-Cys105) after assembly. The combination of one or more 
antibody fragments against tumor antigens with additional cytokines like IL6 or 
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IL12 targets the multi-cytokine complex (Rock et al., .1992, Prot. Eng. 5, 583-591) 
directly to the tumour. 

i 

Example 4 : Segmented human apomyoglobin as an assembly 
device with three segments 

i 

To use more than two segments of a native structure as an assembly device, the 
hydrophobic interface between the segments has to be large enough to provide 
the sufficient hydrophobic interaction for non-covalent linkage. Myoglobin 
(Fig.15) is expressible in large amounts in E. coli (Guillemette et al., 1991, 
Protein Eng. 4, 585-592). Up to six functional domains can be assembled by a 
threefold segmented structure (Fig. 16, 17, 18), three at the N-termini and three 
at the C-termini of the segments. The presence of heme additionally stabilizes 
the native-like apomyoglobin fold and can be used as a switch to influence the 
association constant of the multi-functional complex. 

Example 5: Bioactive peptides as functional domains 

Certain peptides derived from amphipathic loop structures of LPS-binding 
proteins (Hbess et al., 1993, EMBO J. 12, 3351-3356) are able to neutralize 
endotoxin. This effect is enhanced by multivalent display of these short peptides 
(10-15 residues; Hoess, unpublished results). The present invention provides a 
method to express and assemble several of short peptides (Fig. 19), fused to an 
assembly segment, in a multivalent complex or in combination with other 
functional domains. The peptides can be fused either to the N-or to the C- 
terminus (Fig. 20, 21) of the assembly domain via the peptide linkers. 
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Example 6: A purification tail for 1MAC as a functional domain 

Peptide tails consisting of histidines are able to coordinate metal ions. They are 
used for purification of native proteins in immobilized metal affinity 
chromatography (IMAC). Multivalent display of the purification tail considerably 
improves the maximum purity achievable by IMAC (Lindner et al„ 1992, Methods: 
a companion to methods in enzymology 4, 41-56). One or more gene cassettes 
(Fig. 22) encoding a polyhistidine tail can be fused to the assembly segment to 
provide a simple and efficient purification method for multi-functional complexes. 

Example 7: The platelet aggregation inhibitor decorsin as a 
functional domain 

Decorsin, a 39 residue protein of the leech Macrobdelta decora (Fig. 23), acts as 
a potent antagonist of the platelet glycoprotein llb-llla (Seymour et al., 1990, J. 
Biol. Chem. 265, 10143-10147). The gene cassette encoding the decorsin can be 
fused C- or N-terminally to an association segment (Fig. 24, 25). In arterial 
thrombotic deseases, a multivalent decorsin complex combined with an anti-fibrin 
antibody fragment can act as a powerful antithrombotic agent. 
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Claims 

, 1. A multifunctional polypeptide comprising: 

(a) a first amino acid sequence attached to at least one functional 
domain; 

(b) a second amino acid sequence attached to at least one further 
functional domain; and 

(c) optionally, further amino acid sequences each attached to at 
least one further functional domain; 

wherein any one or more of said amino acid sequences interacts with at 
least one of said amino acid sequences in a complementary fashion to 
form a parental, native-like tertiary or optionally quaternary structure and 
wherein said parental, native-like tertiary or optionally quaternary 
structure is derived from a single parent polypeptide. 

2. The multifunctional polypeptide according to claim 1 , wherein said single 
parent polypeptide is ubiquitin, acyl-phosphatase, IL2, calbindin or 
apomyoglobin. 

3. The multifunctional polypeptide according to claim 1 or 2, wherein said 
parental, native-like tertiary or optionally quaternary structure is 
biologically active. 

4. The multifunctional polypeptide according to any one of claims 1 to 3, 
wherein at least one of said functional domains is a fragment derived 
from a member of the immunoglobulin superfamily. 

5. The multifunctional polypeptide according to claim 4, wherein two of said 
functional domains are fragments derived from members of the 
immunoglobulin superfamily. 
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6. The multifunctional polypeptide according to claim 4 or 5, wherein said 
fragments are antibody fragments. 

7. The multifunctional polypeptide according to any one of claims 1 to 6, 
wherein at least one of said functional domains is a biologically active 
molecule or a derivative thereof other than a fragment derived from a 
member of the immunoglobulin superfamily. 

8. The multifunctional polypeptide according to any one of claims 1 to 6, 
wherein the folding of the amino acid sequences is stabilised by a 
covalent bonding. 

9. The multifunctional polypeptide according to any one of claims 1 to 8, 
wherein at least one of said functional domains is coupled to said amino 
acid sequence(s) via a flexible peptide linker. 

10. The multifunctional polypeptide according to claim 9, wherein said 
flexible peptide linker is an antibody hinge region. 

11. The multifunctional polypeptide according to any one of calims 1 to 10, 
wherein at least one of said amino acid sequences is coupled to at least 
one further (poly)peptide. 

12. The multifunctional polypeptide according to claim 11, wherein said 
further (poly)peptide is an enzyme, a toxin, a cytokine, a metal binding 
site, a metal binding protein, a soluble receptor, a DNA-binding domain, 
a transcription factor, an immunoglobulin, a bioactive peptide of 5 to 15 
amino acid residues, a peptide hormone, a growth factor, a lectin, a 
lipoprotein, and a peptide which is able to bind to an independent 
binding entity. 
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13. The multifunctional polypeptide according to any one of claims 1 to 12, 
wherein at least one of said amino acid sequences, functional domains 
or further (poly)peptide(s) is of human origin. 

I 

14. A DNA sequence encoding an amino acid sequence and at least one 
functional domain and, optionally, at least one further functional 
(poly)peptide comprised in the multifunctional polypeptide of any one of 
claims 1 to 13. 

15. A vector comprising at least one DNA molecule of claim 14. 

16. The vector of claim 1 5, which is a bicistronic vector. 

17. A vector cassette characterised in that it comprises a DNA sequence 
encoding an amino acid sequence and optionally at least one further 
(poly)peptide comprised in the multifunctional polypeptide of any one of 
claims 1 to 13, and additionally at least one, preferably a singular cloning 
site for inserting the DNA encoding at least one further functional 
domain. 

18. A vector cassette characterised in that it comprises DNA sequences 
encoding the amino acid sequences, and optionally the further 
(poly)peptide(s) comprised in the multifunctional polypeptide of any one 
of claims 1 to 13, and suitable restriction sites for the cloning of DNA 
sequences encoding the functional domains, such that upon expression 
of the DNA sequences after the insertion of the DNA sequences 
encoding the functional domains into said restriction sites, in a suitable 
host the multifunctional polypeptide according to any one of claims 1 to 
13 is formed. 
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19. The vector cassette according to claim 17 or 18 characterised in that it 
comprises the inserted DNA sequence(s) encoding said functional 
domain(s). 

20. A host cell transformed with at least one vector according to claim 15 or 
16, or at least one vector cassette according to claim 19. 

21. The host cell according to claim 20, which is a mammalian, perferably 
human, yeast, insect, plant or bacterial, preferably E. coli cell. 

22. A method for the production of a multifunctional polypeptide according to 
any one of claims 1 to 13, which comprises culturing the host cell 
according to claim 20 or 21 in a suitable medium, and recovering said 
multifunctional polypeptide produced by said host cell. 

23. A method for the production of a multifunctional polypeptide according to 
any one of claims 1 to 13 which comprises culturing at least two host 
cells according to claim 20 or 21 in a suitable medium, said host cells 
each producing only one of said first and said second amino acid 
sequences attached to at least one further functional domain, recovering 
the amino acid sequences, mixing thereof under mildly denaturing 
conditions and allowing in vitro folding of the multifunctional polypeptide 
according to any one of claims 1 to 13 from said amino acid sequences. 

24. The method according to claim 23, wherein the further amino acid 
sequence(s) (each) attached to at least one further functional domain 
are/is produced by at least one further host cell not producing said first 
or second amino acid sequence. 
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25. The method according to claim 23, wherein at least one further amino 
acid sequence attached to at least one further functional domain is 
produced by the host cell according to claim 20 or 21 producing said first 

( or second amino acid sequence. 

26. A pharmaceutical composition comprising the multifunctional polypeptide 
according to any one of claims 1 to 13 optionally in combination with a 

i 

pharmaceutical^ acceptable carrier. 

27. A diagnostic composition comprising the multifunctional polypeptide 
according to any one of claims 1 to 13. 

28. A kit comprising at least one vector cassette according to claim 17 or 18. 
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Fig.l: segmented tertiary structure for a targeted hetero-association 
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Pig. 2 : Modular cistron encoding functional domains 

N- and/ or C- terminally fused to the assembly device 




reenREO SHEET (RULE 91} 
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Pig. 3 : protein sequence of human ubiquitin (segmented 
after Gly35) 



1 10 20 30 40 50 

MQIFVKTLTG KTITLEVEPS DTIENVKAKI QDKEGIPPDQ QRLIFAGKQL 



60 70 
EDGRTLSDYN IQKESTLHLV LRLRGG* 



Fig. 4: Mrol-Hind III gene cassette encoding for segment 
ALPHA of ubiquitin 



*i FOl C MQ IFVKTLTGKTITLE 
TCC GGA ATG CAG ATC TTC GTT AAA ACC CTC ACC GGT AAA ACC ATC ACC CTC GAA 
9 IB 27 36 45 54 

AGG CCT TAC GTC TAG AAG CAA TTT TGO GAC TGC CCA TTT TGG TAG TCC GAC CTT 

VEPS DTIBNVKAKIQDKE 
CTT GAA COG TCT GAC ACC ATC GAA AAC CTT AAA CCT AAA ATC CAG GAC AAA GAA 
63 72 81 90 99 108 

CAA CTT GGC AGA CTG TGG TAG CTT TTG CAA TTT CGA TTT TAG GTC CTG TTT CTT 

Hiodlll 
G * * A 

GGT TGA TAA GCT T 3 ' 
117 

CCA ACT ATT CGA A 5' 
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Fig. 5: Mrol-Hind III gene cassette encoding for segment 
BETA of ubiquitin 



S G IPP DQQRLI'FA.GRQLE 
TCC GGA ATC CCG CCG CAC CAG CAG CGT CTG ATC TTC CCT GGT CCT CAG CTG GAA 
9 18 27 36 45 54 

AGG CCT TAG GGC GGC CTG GTC GTC GCA GAC TAG AAG CGA CCA GCA GTC GAC CTT 



DCRTLSDYHIQKESTLH L 
GAC GGT CGT ACC CTG TCT GAC TAC AAC ATC CAG AAA GAA TCT ACC CTG CAC CTG 
63 72 81 90 99 108 

CTG CCA GCA TCC GAC AGA CTG ATG TTG TAG GTC TTT CTT AGA TGG GAC GTG GAC 



Hindlll 

V L R I* * * 
GTT CTG CGT CTG TGA TAA 3 ' 

117 126 
CAA GAC GCA GAC ACT ATT 5' 



Fig. 6: EcoRI-MroI gene cassette encoding a flexible linker 
(hu!gG3 ) 



EcoRI M r o i 

EFTPLCDTTHTSG 
5 • GAA TTC ACC CCG CTG GGT GAC ACC ACC CAC ACC TCC GGA 3 ' 

9 18 27 36 

3 • CTT AAG TGG GGC GAC CCA CTG TGG TGG GTG TGG AGG CCT 5 ' 
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Fig. 8: Construction of dicistronic co-expression vectors 




insert second cislron (XbaI/HindHI-digested,blunt) 
into vector (Xbal-digested, blunt) 
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Fig 9 : protein sequence of human ubiquitin with 
intersegmental disufides Cys4 and Cys66 (segmented aft< 
Gly35) 

1 10 20 30 40 50 

MQICVKTLTG KTITLEVEPS DTIENVKAKI QDKEGIPPDQ QRLIFAGKQL 



60 70 
EDGRTLSDYN IQKESCLHLV LRLRGG** 



Fig. 10: Mrol-Hind III gene cassette encoding for segment 
ALPHA-CYS4 of ubiquitin 



MQ ICVKTLTCKTITLE 
TCC GGA ATG CAO ATC TCC GTT AAA ACC CTC ACC GOT AAA ACC ATC ACC CTG GAA 
9 IB 27 36 45 S« 

AGG CCT TAC CTC TAG XCO CAA TTT TGG GAC TGG CCA TTT TOG TAG TGG GAC CTT 



VEPSDTIENVKAKIQDKE 
GTT GAA CCG TCT GAC ACC ATC GAA AAC GTT AAA GCT AAA ATC CAG GAC AAA GAA 
63 72 81 90 99 108 

CAA CTT GGC AGA CTG TGG TAG CTT TTG CAA TTT CGA TTT TAG GTC CTG TTT CTT 



Blndlll 
G * * A 

GGT TGA TAA GCT T 3 ' 
117 

CCA ACT ATT CGA A 5' 
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Fig. 11 : Mrol-Ascl-Hind III gene cassette encoding for 
segment BETA-CYS66 with C-terminal GGSGGAP linker of 
ubiquitin 

I ,• 

I P P" DQQRLI FACRQLE 
TCC GGA ATC CCG CCG GAC CAC CAG COT CTC ATC TTC GCT GGT CCT CAG CTG GAA 
9 18 27 36 45 54 

AGG CCT TAG GGC GGC CTG GTC GTC GCA GAC TAG AAG CGA CCA GCA GTC GAC CTT 



D G R T L S D Y N I Q K E S c " *V 

GAC GGT CGT ACC CTG TCT GAC TAC AAC ATC CAG AAA GAA TCT TCC CTG CAC CTG 
63 72 Bl 90 99 1°B 

CTG CCA GCA TOG GAC AGA CTG ATG TTG TAG CTC TTT CTT AGA ACQ GAC CTG GAC 



AscX HlndXXI 
VLRL'OOSOCAS** 
GTT CTC CGT CTC OOO GOO ACC CGA GGC CCO CCC TGA TAA 3 

CAA GAC GCA GAC CCC CCC TCC CCT CCO CCC GGC ACT ATT 5 



Pig. 12: Protein sequence of human IL-2 (segmented afte: 
His79) 

10 20 30 40 50 60 

APTSSSTKKT QLQLEHLLLD LQMILNGINN YKNPKLTRML TTKFYMPKKA TELKHLQCLE 

70 90 100 HO 120 

EELKPLEEVL NLAQSKNFHL RPRDLISNIN VIVLELKGSE TTFMCEYADE TATIVEFLNR 



130 

WITFCQSIIS TLT 
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Fig. 13 : Mrol-Ascl-Hind III 
segment ALPHA of human IL-2 
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M 
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L 


TTA 
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CAG 


ATG 


ATT 
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63 






72 






81 


AAT 


GAC 


GAC 


CTA 


AAT 


GTC 


TAC 


TAA 


AAC 



gene cassette encoding for 



K 


K 


T 


Q 


L 


Q 


L 


E 


H 


AAG 


AAA 


ACA 


CAG 


CTA 


CAA 


CTG 


GAG 


CAT 






36 






45 






54 


TTC 


TTT 


TGT 


GTC 


GAT 


GTT 


GAC 


CTC 


CTA 


N 


G 


I 


N 


N 


Y 


K 


N 


P 


AAT 


GGA 


ATT 


AAT 


AAT 


TAC 


AAG 


AAT 


ccc 






90 






99 






108 


TTA 


CCT 


TAA 


TTA 


TTA 


ATG 


TTC 


TTA CGG 



KLTRMLTFKFYMPKKATE 
AAA CTC ACC AGG ATG CTC ACA TTT AAG TTT TAC ATG CCC AAC AAG GCC ACA GAA 
117 126 135 144 153 162 

TTT GAG TOG TCC TAC GAG TGT AAA TTC AAA ATG TAC GGG TTC TTC CGG TGT CTT 



LKHLQCLEEELKPLEEVL 
CTG AAA CAT CTT CAG TGT CTA GAA GAA GAA CTC AAA CCT CTG GAG GAA GTG CTA 
171 1B0 189 198 207 216 

GAC TTT GTA GAA GTC ACA GAT CTT CTT CTT GAG TTT GGA GAC CTC CTT CAC GAT 



A«cX HisdIIZ 

ULAQSKNFHGGSGGAP* 
AAT TTA GCT CAA AGC AAA AAC TTT CAC GGG GGG AGC GGA GGC GCG CCG TGA T 

225 234 243 252 261 

TTA AAT CGA GTT TCG TTT TTG AAA GTG CCC CCC TOG CCT CCG CGC GGC ACT A 



♦ 
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Pig. 14 : Mrol-Ascl-Hind III gene cassette encoding for 
segment BETA of human IL-2 



I 



S T °l L R P R D L I S N I N V I V I. B 

TCC GGA TTA AGA CCC AGG GAC TTA ATC AGC AAT ATC AAC GTA ATA GTT CTG GAA 
9 18 27 .36 45 5* 

AGG CCT AAT TCT GGG TCC CTG AAT TAG TOG TTA TAG TTG CAT TAT CAA GAC CTT 



LKGSETTFMCEYADETAT 
CTA AAG GGA TCT GAA ACA ACA TTC ATG TCT GAA TAT GCT GAT GAG ACA GCA ACC 
63 72 81 90 99 1° B 

CAT TTC CCT AGA CTT TGT TCT AAC TAC ACA CTT ATA CGA CTA CTC TGT CGT TGG 



IVEPLNRWITFC QSI 1 ' * * 
ATT GTA CAA TTT CTG AAC AGA TGG ATT ACC TTT TCT CAA AGC ATC ATC TCA ACA 
U7 126 135 144 153 162 

TAA CAT CTT AAA GAC TTG TCT ACC TAA TGG AAA ACA GTT TCC TAG TAG ACT TGT 



AocI HindXIX 
LT GGSGGAP* 
CTG ACT GGG GGG AGC GGA GGC GCG CCG TGA T 3 * 

171 180 189 

GAC TGA CCC CCC TOG CCT CCG CGC GGC ACT A S' 
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Fig. 15 Protein sequence of human apomyoglobin (cut after 
Lys47 and Lys98) 

vlnvwgkvea dipghgqevl irlfkghpet lekfdkfkhl 
dlkkhgatvl talggilkkk ghheaeikpl aqshatkhki 
ciigvlqskh pgdfgadaeg amnkalelfr kdmasnykel 

151 
gfqg 



mglsdgewql 
51 

ksedemkase 
101 

pvkylef ise 



Fig. 16 : MroX-Asd-Hind III gene cassette encoding for 
segment ALPHA of human apomyoglobin 
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CTG 
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GTT 
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18 






27 






36 
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AGG 


CCT 


TAC 
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GAC 


AGA 


CTG 


CCA 


CTT 


ACC 


GTC 


GAC 


CAA 


GAC 


TTG 


CAA 



W G 
TGG GGT 
54 



KVEADI PGHGQEVLIRLF 
AAA GTT GAA GCT GAC ATC CCG GGT CAC GGT CAG GAA GTT CTG ATC CGT CTG TTC 
63 72 81 90 99 108 

TTT CAA CTT CGA CTG TAG GGC CCA CTG CCA GTC CTT CAA CAC TAG CCA CAC AAG 



KGHPETLEKFDKFKGGSG 
AAA GGT CAC CCG GAA ACC CTG GAA AAA TTC GAC AAA TTC AAA GGG GGG ACC GGA 
117 126 135 144 153 162 

TIT CCA GTC GGC CTT TGG GAC CTT TTT AAG CTG TTT AAC TTT CCC CCC TCG CCT 



A«cl Hindlll 
GAP* 
GGC CCG CCG TGA T 3 1 
171 

CCG CGC GGC ACT A 5* 
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Fig. 17: Mrol-Ascl-Hind III gene cassette encoding for 
segment BETA of human apomyoglobin 



T 0l C HLKSEDEMK.ASEDLKK 
TCC GGA CAC CTG AAA TCT GAA GAC CAA ATG AAA OCA TCT GAA CAC CTC AAA AAA 

AGG CCT GTG GAC TTT AGA CTT CTC CTT TAC TTT CGT AGA CTT CTG GAC TTT TTT 



HGATVLT ALGGILKKKGH 
CAC GGT GCT ACC GTT CTG ACC GCT CTG GGT GOT ATC CTG AAA AAA AAA GGT CAC 

63 72 81 90 99 

GTG CCA CGA TGG CAA GAC TGG CGA GAC CCA CCA TAG CAC TTT TTT TTT CCA GTG 



MEAEIKPLAQSHATKH KG 
CAC CAA GCT GAA ATC AAA CCG CTG GCT CAG TCT CAC GCT ACC AAA CAC AAA GGG 
117 126 135 144 153 lw 

GTG CTT CGA CTT TAG TTT GGC GAC CGA GTC AGA GTG CGA TGG TTT GTG TTT.CCC 



ABflX BlndlZZ 
G S G G A P * 
GGG AGC GGA GGC CCG CCG TGA T 3' 

171 180 
CCC TCG CCT CCG CGC GGC ACT A 5' 
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Fig. 18: Mrol-Ascl-Hind III gene cassette encoding for 
segment GAMMA of human apomyoglobin 



Mr ° I tpVKYLEFISEC IIQV 

iccloA atc ccc err aaa tac ctc gag rrc atc tct gaa tgc atc atc cac gtt 

AGO CCT TAG GGC CAA TTT ATC CAC CTC AAG TAG AGA CTT ACG TAG TAG CTC CAA 



CTG CAG TCT AAA CAC CCG GGT GAC TTC GGT GCT GAC GCT GAA GGT GCT ATC AAC 
GAC GTC AGA TTT GTC GGC CCA CTC AAG CCA CGA CTC CGA CTT CCA CGA TAC TTC 



AAA GCT CTG GAA CTC TTC CGT AAA GAC ATC ScT TCT AAC TAC AAA GAA CTC GGT 
TTT CGA GAC CTT GAC AAG GCA TTT CTC TAC CGA AGA TTC ATC TTT CTT GAC CCA 



117 

__ GAC 

PSQFQETF 

XbcI Hindlll 
PQGCGSCGAP* 
TTC CAG GGT GGG GGG AGC GGA GGC GCG CCG TCA T 3 ■ 

171 ISO 189 

AAG GTC CCA CCC CCC TCG CCT CCG CGC GGC ACT A 5' 
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Fig. 19: Peptide sequence of an endotoxin-neutralizing 
peptide as a functional domain 

11 .11 
RWKVRKSFFKL Q 



Fig. 20: N-terminal EcoRV-EcoRI cassette encoding an 
endotoxin-neutralizing peptide 



ECORV EC J >RI C . 
IMRWKVRKSFFKLQ EP 

5 • ATC ATC CCT l TGG AAA GTT CGT AAA TCT TTC TTC AAA CTC CAG GAA TTC 3 

9 18 27 36 45 

3« TAG TAC CCA ACC TTT CAA GCA TTT AGA AAG AAG TTT GAC GTC CTT AAG 5 



Fig. 21: C-terminal Ascl-Hindlll cassette encoding , an 
endotoxin-neutralizing peptide 



. _ Hlndlll 
A»cl * 
APRWKVRKSFFKLQ 
5 • GCG CCG CGT TOG AAA GTT CGT AAA TCT TTC TTC AAA CTG CAG TGA TAA 3 

9 18 27 36 4S 

3 ' CGC GGC GCA ACC TTT CAA GCA TTT AGA AAG AAG TTT GAC GTC ACT ATT 5 



Fig. 22 Ascl-HINDIII Gene cassette encoding a purification 
tail for IMAC 

AacI Bind III 

A PH HHHHH * * 
5 • GCG CCG CAC CAC CAC CAC CAC CAC TGA TAA 3 ' 

9 18 27 

3' CGC GGC GTG GTG CTG GTG GTG CAC ACT ATT 5' 



WO 96/13583 



PCT/EP95/04117 



15/15 

Fig. 23 Protein sequence of the platelet aggregation 
inhibitor decorsin as a functional domain 



1 11 21 31 

APRLPQCQGD DQEKCLCNKD ECPPGQCRFP RGDADPYCE 



Fig. 24 N-terminal EcoRV-EcoRI cassette encoding the 
platelet aggregation inhibitor decorsin 



EcoRV 

DIAPRLPQCQGDDQEKCL 
GAT ATC GCT CCG CGT CTG CCG CAG TGC CAG GGT GAC GAC CAG GAA AAA TGC CTG 
9 18 27 36 45 54 

CTA TAG CGA GGC GCA GAC GGC GTC ACG GTC CCA CTG CTG GTC CTT TTT ACG GAC 

CNKDECPPGQCRFPRCDA 
TGC AAC AAA GAC GAA TGC CCG CCG GGT CAG TGC CGT TTC CCG CGT GGT GAC GCT 
63 72 81 90 99 108 

ACG TTG TTT CTG CTT ACG GGC GGC CCA GTC ACG GCA AAG GGC GCA CCA CTG CGA 



EcoRX 

D P Y C E F 
GAC CCG TAC TGC GAA TTC 3' 

117 126 
CTG GGC ATC ACG CTT AAG 5' 



Fig. 2 5 C-terminal Ascl-Hindlll cassette encoding the 
platelet aggregation inhibitor decorsin 



ABC I 

APAPRLPQCQ GDDQEKCL 
CCG CCG GCT CCG CGT CTG CCG CAG TGC CAG GGT GAC GAC CAG GAA AAA TGC CTG 
12 21 30 39 48 57 

CGC GGC CGA GGC GCA GAC GGC GTC ACG GTC CCA CTG CTG GTC CTT TTT ACG GAC 

CNKDECPPGQCRFPRCDA 
TGC AAC AAA GAC GAA TGC CCG CCG GGT CAG TGC CGT TTC CCG CGT GGT GAC GCT 
66 75 84 93 102 111 

ACG TTG TTT CTG CTT ACG GGC GGC CCA GTC ACG GCA AAG GGC GCA CCA CTG CGA 

HindXIX 

D P Y C E 
GAC CCG TAC TGC GAA TGA TAA 3' 

120 129 
CTG GGC ATG ACG CTT ACT ATT 5 1 
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